AD-754  577 


ESTIMATION  OF  A  CONVEX  COMBINATION  OF 
PROPORTIONS 

Larry  H.  Crow 

Army  Materiel  Systems  Analysis  Agency 
Aberdeen  Proving  Ground,  Maryland 

April  1972 


ISJiTilS 


National  Technical  Information  Service 
0.  S.  DEPARTMENT  OF  COMMERCE 

5285  Port  Royal  Road,  Springfield  Va.  22151 


TECHNICAL  REPORT  NO.  56 


ESTIMATION  OF  A  CONVEX  COMBINATION  OF  PROPORTIONS 


Larry  H.  Cron 


April  1972 


sD  D  CV 


POUE 


FE8  2  m 


n 


mstsa  oTSii 

B 


Approved  for  public  release; 
distribution  unlimited. 


U.  S.  ARMY  MATER  IEL  SYSTEMS  ANALYS  IS  AGENCY 
Aberdeen  Proving  Ground,  Maryland 


DISPOSITION 


Destroy  this  report  when  no  longer  needed.  Do  not  return  it  to 
the  originator. 


DISCLAIMER 

The  findings  in  this  report  are  not  to  be  construed  as  an  official 
Department  of  the  Army  position. 


Warning 

Information  and  data  contained  in  this  document  are  based  on 
the  input  available  at  the  time  of  preparation.  The  results  may  be 
subject  to  change  and  should  not  be  construed  as  representing  the 
AMC  position  unless  so  specified. 


\  MC3S1M  isr  1 

rtf 

WI'J  SrtlH* 

1.9 

Mi  i:.1M 

□ 

pit;  f'tn 

□ 

jajinv-.i 

tt  • 

WSWWMtMueiun  cuts 

"  btk  ml  srasit 

••rnyisnm 


DOCUMCNT  CONTROL  DATA  -RAD 

_  ffrcurttr  clM«//lc«lfM  if  fill#,  My  tf  «l<(Mcr  Mrf  Mod 


t.  OMIAINATINA  ACTIVITY  (C*»WR»R 


U.S.  Army  Materiel  Systems  Analysis  Agency 
Aberdeen  Proving  Ground,  Maryland 


on  M*t  W  MMNtf  vRm  At  mmrmlt  fttrt  it  eftttittttfj 


lANIROlITtKCURiTV  CLASSIFICATION 

Unclassified 


ESTIMATION  OF  A  CONVEX  COMBINATION  OF  PROPORTIONS 


4.  OIICRIRTIVI  NOTCS  fiypt  ti N4  Meitiirt  Rtlti) 


»•  AUTHONISI  (ftt&t  MIM,  AitfSt  *NN«J,  IttlMM; 


Larry  H.  Crow 


*.  MKOONT  OATC 

April  1972 


M.  omiiNtTori  mpoiit  Nuuitmii 


».»««jict»o.  AMCMS  Code  2270.1031 


Technical  Report  No.  56 


it.  wowmiiwi  wiLirowy  activity 

U.S.  Army  Materiel  Command 
Washington,  D.C. 


Tins  report  discusses  three  sampling  procedures  available  for  estimating  convex 
combinations  of  proportions  and  illustrates  the  feasibility  and  practical  usefulness 
of  these  procedures.  The  first  procedure  is  an  analogue  to  simple  random  sampling 
from  a  single  population.  The  remaining  two  procedures  are  infinite  population 
analogues  to  optimum  and  proportional  sampling  from  a  single  finite  population.  The 
practical  aspects  of  selecting  a  procedure  are  discussed  and  numerical  examples  arc 
given. 


I«T1.  1  JM  M,  MCN 


Unclassified 


TECHNICAL  REPORT  NO.  56 


ESTIMATION  OF  A  CONVEX  COMBINATION  OF  PROPORTIONS 

Larry  H.  Crow 
April  1972 

Approved  for  public  release; 
distribution  unlimited. 


AMCMS  Code 


U.S.  ARMY  MATERIEL  SYSTEMS  ANALYSIS  AGENCY 
ABERDEEN  PROVING  GROUND,  MARYLAND 


I 


U.S.  ARMY  MATERIEL  SYSTEMS  ANALYSIS  AGENCY 


TECHNICAL  REPORT  NO.  56 


LHCrow/dga 

Aberdeen  Proving  Ground,  Md. 
April  1972 


ESTIMATION  OF  A  CONVEX  COMBINATION  Or  PROPORTIONS 

ABSTRACT 

This  report  discusses  three  sampling  procedures  available  for 
estimating  convex  combinations  of  proportions  and  illustrates  the  feasi¬ 
bility  and  practical  usefulness  of  these  procedures.  The  first  procedure 
is  an  analogue  to  simple  random  sampling  from  a  single  population.  The 
I'eraairing  two  procedures  are  infinite  population  analogues  to  optimum 
and  proportional  sampling  from  a  sing’e  finite  population.  The  practical 
aspects  of  selecting  a  procedure  are  discussed  and  numerical  examples 
are  giver*. 
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ESTIMATION  OF  A  CONVEX  COMBINATION  OF  PROPORTIONS 


! .  INTRODUCTION  AND  SUMMARY 

Interest  in  the  subject  of  this  report  arose  recently  from  an  investigation 
of  appropriate  testing  plans  for  the  Improved  Hawk  and  SAM-D  missiles.  The 
testing  objectives  for  the  two  missile  systems  are  similar  and  involve 
hypothesis  testing  cn  the  true  value  of  a  certain  convex  combination  cf 
proportions.  Aside  from  missile  testing,  however,  parameters  of  this  type 
are  of  general  practical  interest  since  they  occur  naturally  in  a  variety  of 
statistical  problems.  The  purpose  of  this  report  is  to  discuss  sampling  tech¬ 
niques  available  for  estimating  convex  combinations  of  proportions  and  to 
illustrate  the  feasibility  and  practical  usefulness  of  these  techniques. 

Specifically,  three  sampling  procedures  are  discussed  in  Sections  3  and  A, 
The  first  procedure,  it  will  be  shown,  may  be  considered  as  an  analogue  to 
simple  random  sampling  from  a  single  population.  The  remaining  two  pro¬ 
cedures  arc  infinite  population  analogues  to  optimum  and  proportional  sampling 
from  a  single  finite  population.  (See  Reference  1,  Sections  5.?  and  5.5.) 

In  Section  5  the  practical  aspects  of  selecting  a  particular  sampling 
procedure  will  be  discussed  cr.d  in  Section  6  numerical  examples  of  these 
procedures  will  he  given. 

2.  NOTATION  AND  NOMENCLATURE 

A  diahe terras  population  is  defined  to  be  a  population  (finite  or  infinite) 
in  which  a  proportion  p",  say,  of  the  items  have  a  certain  characteristic  and 
^  proportion  1-p'  of  the  items  do  not  have  it.  Throughout  this  report  it  will 
he  assumed  that  there  are  k  infinite  dichotomus  populations  denoted  symbolically 
by  Pj,..,,PK.  ALso,  if  an  item  drawn  at  random  from  one  of  these  populations 
has  the  characteristic  of  interest  then  it  will  be  said  that  8  eueceos  has 
been  realized. 

Let  E  and  VAR  denote  expectation  and  variance,  respectively,  and  let 
be  the  random  variable 


X.  = 
1 


1  if  an  item  drawn  at 

random  from  Pj;  results 
in  a  success 

0  otherwise 


1  =  1,..., k.  It  is  assumed  that  Prob(X^«l)»pi ,  where  0<p.  <  1,  i  k, 


Observe,  also,  that  E(X^)*p^,  and  VAR(X^)«pi (1-p^) ,  i*l,...,k. 


If  f*.  i-1,. 


,k,  are  numbers  such  that  0  <  f^  and 


1=  E  f. , 
1  9 


then 


i*l 


P*  *  ^Pi 
i=l  1  1 


is  a  convex  combination  of  pj,...,p  .  The  results  derived  in  the  remainder 
«f  trus  paper  are  based  on  the  assumption  that  the  f^s  are  known  constants, 

the  r>-  fs  are  unknown  parameters  and  t».. t  p  is  the  parameter  to  be  estimated, 

* 

5.  iUNlX)M  ALLOCATION 

In  considering  an  estimate  of  a  parameter  the  experimenter,  of  course, 
is  mainly  interested  how  ouch  confidence  he  has  that  the  estimate-  will  be 
close  to  the  true  value  of  the  pavametar.  It  would  seem,  therefore,  that 
to  have  a  high  degree  of  confidence  in  an  estimate  of  p  woi*ld  require  that 
sampler  be  taken  from  each  of  the  k  populations.  If  this  ir  true,  then  to 
obtain  a  *:good'?  oitirxtr,  of  p,  when  k  is  largo,  may  be  a  very  formidable  task, 
indeed.  Fortunately,  however,  this  is  not  the  case. 
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A  procedure  shell  be  presented  in  this  section  that  will  enable  an 
experimenter  to  estiMte  p  without  necessarily  having  to  sample  from  each 
of  the  populations.  This  procedure  is  not  detrimental  to  the  interest  of 
the  experimenter  since  the  precision  and  all  other  statistical  properties 
of  the  proposed  estimator  depend  only  on  the  sample  size  and  not  on  the  number 
of  populations.  Other  desirable  properties  of  this  estimator  will,  also, 
he  presented  in  the  sequel. 

Exact  distribution  theory  is  derived  in  this  section,  too,  that  will 
allow  one  to  construct  operating  characteristic  curves  to  test  hypothesis 
on  the  true  value  of  p.  Also,  the  analogy  of  this  procedure  to  simple 
random  sampling  froa  a  single  dichotomus  population  shall  be  discussed. 

The  procedure  is  defined  next. 

Definition  3.1 

Let  n  be  the  total  sample  size.  For  the  rth  observation,  r=l,...,n, 
choose  one  of  the  populations  »*»«,P«C»  with  the  probability  of  selecting  Pi 

equal  to  f.,  i»l . k.  Let  denote  the  population  selected.  Next  choose 

an  item  at  random  from  Pjr  and  observe  whether  or  not  a  success  was  obtained. 
This  procedure  is  called  random  allocation. 

The  analogy  of  this  procedure  to  simple  random  sampling  is  of  particular 
inteiest.  Let  P  be  a  dichotomus  population  and  suppose  that  the  proportion 
of  items  in  P  that  have  the  characteristic  of  interest  varies.  Specifically, 
assume  that  at  any  Riven  instant  the  probability  is  f^  that  the  proportion  of 
items  in  P  with  the  characteristic  of  interest  is  p^,  i=l,...,k.  Hence,  if 
on?  draws  an  item  at  random  from  P  then  the  probability  is 

k 


that  n  success  is  observed.  One  may,  therefore,  simply  assume  that  P  is  a 
dichotomus  population  with  the  proportion,  of  it*uas  with  the  characteristic 
of  interest  equtl  to  p.  Thus,  random  allocation,  froa  this  point  of  view, 
merely  describes  the  process  of  taking  a  random  sample  of  size  n  from  P. 
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A  statistical  analysis  of  this  procedure  will  now  be  developed. 

Let  3L  be  the  random  vector 
r 


V<V' 


•V- 


where 


1  if  r**1  item  is  drawn 

from  and  a  success  is 

obtained 

0  otherwise 


1=1,..., k i r«l i • • « 

Since  selecting  a  population  is  statistically  independent  from  drawing 
an  item  it  follows  that 


Prob(Xir.l)«f.p.f 


i=l . ,k,r*l, , . . ,n.  Now,  let 


v«n,  , 

r  t.i  ir 


r«l,...,n,  and  note  that  the  events  that  deterrdne  Xr  are  disjoint.  It 
follows  then  that 


k 

Prob(Yr=l)«  Z  fiPi, 
i*l 

r=l,...,n.  That  is,  Yj****»'n  are  independent  Bernoulli  random  variables  with 
mean  p.  Observe,  also,  that 
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1  if  r**1  observation  results 

in  a  success 

0  otherwise 


r=l , . . . ,n. 

The  maximum  likelihood  e^tinate  (MLE)  of  p  will  be  given  in  the  next 
theorem.  It  will  be  shown  later  that  the  MLE  is,  in  fact,  the  "best" 
estimate  of  p  when  data  arise  from  random  allocation. 


Jheorem  3.2 

For  a  total  sample  of  size  n  the  MLE  of  p  from  random  allocation  is 

n 

P_  “  £  y_/n. 
ran  r*l  r 


(3.1) 


i  .e.,  the  MLE  of  p  is  the  mean  number  of  successes  in  the  sample. 
Proof 


>•  i 

i 


I ! 


It  is  straightforward  to  show  that  the  likelihood  function  for  the 

experiment  is 


L'h . v  •  ii  **  ir- 


(3.2) 


Now 


,  note  that  f ipi  may  be  written  as  ^p,  i=l,...,k,  where 


Z  c,*l, 
i«l  1 


Hence,  (3.2)  becomes 


11 


L(*i . V=j,  >)X‘r  (*-p) 


1  -  L  X. 
i-1  lr 


(3.3) 


Thus , 


n  k 

log  L(X.,...,X  }=  1  E  X  log  c.p  +  n  log  (1-p) 
1  n  r=l  i=l 


n  k 

-  E  EX.  log  (1-p). 
r=l  i=l  ir 


Setting  the  derivative  of  log  equal  to  0,  it  follows  that 


n  k 
E  E 
r*l  i= 


i!,  -  (»  -  j,  j.hrJ/H  ’  ° 


Therefore, 


Since 


n  k 

Pr«  Z  1  Xlr/n* 

ran  r*l  i=l  iT 


V*Xir’ 

i®l 


r«=l,...,n,  the  theorem  is  proved. 


A  result  concerning  the  distribution  of  will  be  given  next.  Notice 

rsn 


that  this  result  does  not  depend  on  k. 


Theorem  3.3 


The  distribution  of  nprftn  is  Binomial  with  parameters  (n,p). 

t.«.,  Prob(prln™M>”’ci-p)n-'\ 


m=0,l , . . . ,n. 
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Proof 


Since  Yj,...,Yn  are  independent  Bernoulli  random  variables  with  mean  p  then 

n 


is  a  Binomial  random  variable  with  parameters  (n,p).  The  theorem  is  proved 
because 


n 

np„  «  l  Y  . 

ran  _ ,  r 

r=l 

Some  optimum  properties  of  prjm  are  given  in  the  following  theorem. 

Theorem  3.4 

The  MLE  pran  of  p  has  the  following  optimum  properties: 

(i)  sufficient  (i.e.,  uses  all  the  information  available  from  random 

allocation  in  estimating  p). 

(ii)  unbiased  (i.e.,  the  expectation  of  pran  is  p). 

(iii)  efficient  (i.e.,  has  the  smallest  variance  among  all  unbiased  estimates 

of  p  ob-.ained  from  random  allocation). 

(iv)  strongly  consistent  (i.e.,  p  will  always  converge  to  p  as  n  becomes 

XaTl 

large) . 

(v)  converges  in  (i.e.,  Prob(|prgn-p|<e)-^l  as  n-*»»,  for  any 

probability  e>0). 
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Proof 


(i)  By  (3.3)  the  likelihood  function  may  be  written  as 


L(X1#...,Xn)»A»B 


where 


k  np. 
A=  n  c. 
i*l  1 


ran 


and 


ran 


n-np 

(1-p) 


ran 


This  factorization  is  a  necessary  and  sufficient  condition  to  insure  that 
p  be  sufficient.  (See  Reference  2,  Section  5.5.1.) 

iflll 

(ii)  This  follows  directly  from  Theorem  3.3. 


(iii)  One  can  show  that  the  likelihood  function,  given  by  (3.3),  implies  that 
the  information,  I(p),  in  the  sample  (Reference  2,  6.1.3)  equals 

n(pCl-p)]’1. 

The  Cramer-Rao  lower  bound  for  an  unbiased  estimator  of  p  (Reference  2,  6.1.3) 
is 

1 1  (P)  ]  =  n^pfl-p). 

But  by  Theorem  3.3 

VAR(pran)-n“lP(l-P). 
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But 


VAR(Prgn)  ■  P(l-p)/n  1  l/4n. 


since  0  <  p  <  1.  Thus , 


Prob(|prftn-p|>e)  £  l/4ne2 


or  equivalently. 


Prob(!pran-p|<e)  >.  l-l/4ne2. 

Another  result,  independent  of  k.  which  is  of  practical  significance  for 
large  sample  size  is  the  following. 

Theorem  3.6 

Let  n  be  the  total  sample  size  and  let  q-l-p.  Then  the  asymptotic 
distribution  of 

P  -p 
ran  v 

is  Normal  (0,1),  as  n  — ►  ~. 

Proof 

This  is  a  direct  result  of  the  Central  Limit  Theorem  (Reference  3,  2c. 5). 
Remark 

Whereas  Definition  3.1  specifies  that  an  item  be  drawn  immediately  after 
the  population  is  selected  this  is  not,  in  practice,  necessary.  In  practice 
random  sample  sizes  nj,...,T^,say,  may  be  determined  first,  where  n^^  is  the 
number  of  items  to  be  drawn  from  and 

k 

n»  Z  n. . 
i«l  1 

■  fter  these  sample  sizes  are  determined,  samples  are  drawn  from  the  respective 
populations  and  prfln  is  obtained. 
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4.  OPTIMUM  AND  PROPORTIONAL  ALLOCATION 

If  independent  random  samples  are  taken  from  each  of  the  k  populations 
it  seems  natural  to  estimate  p^  by  the  mean  number  of  successes  in  the  sample 
from  P. ,i=l ,. . . ,k.  Moreover,  the  convex  combination  of  these  estimates  would, 
also,  seem  the  natural  function  to  use  as  an  estimate  of  p  since  the  weights 
fj,...,f^  are  assumed  known.  This  approach  to  estimating  p  is  considered  in 
this  section. 

For  a  fixed  total  sample  size  n  the  question  arises  as  to  how  one  should 
assign  the  sample  sizes  n^,...,^,  where  is  the  size  of  the  sample  to  be 
drawn  from  P. ,i=l , . . . ,k.  This  section  discusses  two  procedures  for  determining 
these  sample  sizes  so  as  to  obtain  estimators  of  p  which  have  certain  desirable 
properties.  It  is  t-  be  remarked  that  these  procedures  are  analogus  to 
optimum  and  proportional  sampling  methods  from  a  finite  population. 

A 

Let  p.  be  the  moan  number  of  successes  in  a  sample  of  size  n^  taken  from 
l\  ,i=l,...,k.  The  corresponding  estimator  for  p  is  then 


P.  i  fiPl. 

i*l 


Now,  p.^  is  the  MLE  of  pj,i«l,...,k.  Further,  Pj  is  an  unbiased  estimator 
of  pi,i=l,... ,k,  and,  also,  enjoys  having  the  smallest  variance  of  all  such 
estimators.  In  general,  unbiatme88  and  minimum  variano8  are  very  desirable 

A 

properties  for  an  estimator  to  possess.  By  virtue  of  the  fact  that  p.  is  an 

A  X 

unbiased  estimator  it  follows  that  p  is,  also,  an  unbiased  estimator  since 


E(p).  I  f,E(p.).p. 
i»l  1  1 


Observe,  also,  that 


VAR (p) 


i-l  1 


VAR(Pi)  =  i^ffpi(l-pi)/ni< 


(4.1) 
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From  the  above,  it  is  seen  that  the  variance  of  p  depends  directly  on  the 
sample  sizes  n. ,i«l,...,k,  which  are  chosen  by  the  experimenter.  The  problem 
of  allocating  the  n^  to  insure  that  p  will  have  the  smallest  possible  variance 
will  be  considered  next. 

Theorem  4.1 

A 

For  a  total  sample  size  n,  VAR (p)  is  minimized  if 


where 


ni*win* 


w. 

1 


£  ^ [Pj (1-Pi )] 1/2 
j=l  1  J  J 


(4.2) 


i=l ,  < . . , 


k. 


Proof 

A 

Minimizing  VAR(p),  given  by  (4.1),  subject  to  n1+...+nk»n  is  equivalent 
to  minimizing 

k 

E  f|pi(l-pi)/ni+X(n1+n2+«»»+nk-n)  (4.3) 


where  X  is  a  Lagrange  multiplier.  Taking  the  derivative  of  (4.3)  with  respect 
to  nj  and  setting  this  equal  to  0  yields 

-f?Pi(l-pi)/n?-*-X=0.  (4.4) 

Thus, 

Vfi[Pi(1-Pi)]1/2A1/2,  i*l»....k.  (4.5) 
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Since 


k 

n=  Z  n . 

j=l  J 


Equation  4,S  implies  that 

1/2  k  1/2 

*  ’  »  £  fjlP^l-Pj)]  7  /n. 
j=l  '  ' 

Hence,  w^,  given  by  (4.2),  is  immediate  for  i«l,...,k. 

Definition  4 >2 

Let  n  be  the  total  sample  size,  When  n.=w.n,  w.  given  by  (4.2), 

*  i  1  a 

i=l,...,k,  the  procedure  is  called  optimum  allocation  and  p  is  denoted  by 

^opt* 


The  weights  Wj  may  or  may  not  be  known  in  practice.  If  they  are  not 
known  then,  of  course,  optimum  allocation  cannot  be  employed.  Another 
allocation  method  which  is  always  feasible  in  practice  is  defined  next. 


Definition  4.3 


Let  n  be  the  total  sample  size.  When  n.=f.n,  i=l,...,k,  the  procedure 

A  11 

is  called  pvovortional  allocation  and  p  is  denoted  by  p 

*prop 

In  the  next  section  a  comparison  of  the  three  allocation  procedures 


presented  in  this  report  will  be  made.  This  comparison  will  be  based  on  a  few 
important  factors  which,  it  is  felt,  should  be  taken  into  account  when 
choosing  among  these  procedures.  One  point  of  interest  will,  of  course,  be 
the  relative  precision  of  the  corresponding  estimators. 


Now,  by  the  way  optimum  allocation  was  defined  it  is  known  that 


VAR<Popt>  i 


VAR<W- 
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suiinitfijifitt 


The  relationship  between  the  variances  of  p  and  p  will  be  considered 

prop  rran 

next. 

Theorem  4.4 

Let  n  be  the  total  sample  size.  Then 

Proof 

This  result  follows  easily  since 

™(V°p1'i!;1fipi(1'pi)/v 

and 

n^f^n,  i=l,. . .  ,k. 

Theorem  4.5 

VAR(Pprop)  i  VARtp  ran^ 

Proof 

Let  n  >  k  be  the  total  sample  size.  By  (4.6) 

VARWi!1  fipitl-pi)/n 

and  by  Theorem  3.3 

VARCpran5"P(1'p)/n» 
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sgggnTCsggg^r 


where,  of  course. 


Observe,  now,  that 


P*  z  fjPi 
i«l  1  1 


k 


VAR^rop)'i«1fiPiC,"P+P”Pi)/n 


K  * 

=  Z  f^a-pj/n-  Z  fipi(p-pi)/n 
i«l  1  i**l 


-  VAR (Pran)+  1  ^PjCp-Pi)/®* 
i-»l 


It  will  next  he  shown  that 


To  show  this,  consider  a  randon  variable  1  which  assuaes  the  values 

pl,“*’Pk*  30(1  SUPP°*°  t*iat 

Prob(Z»?i)»f, , 

i^l Recall  here  that 


Observe,  now,  that 


l  f.»l. 
i*l  1 


(4.7) 
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*  -  VAR (2). 


Since  VAR(Z)  is  nonnegative  (4.7)  is  established  and,  hence,  the  theorem 
is  proved. 

The  next  theorem  gives  two  asymptotic  properties  of  p  *  and  p  as  the 

*opt  rprop 

total  sample  size  increases. 


Theorem  4.6 

The  estimators  p^and  Ppr0p  of  p  have  the  following  properties: 

(i)  strongly  consistent  (i.e.,  P0pt ^Pprop^  always  converge  to  p  as 

n  becomes  large). 

(ii)  converges  in  (i.e.,  Prob(*-i-p|<e)-^-  1  as  n  -*•  •*,  for  any  e>0 
probability  P-Popt.PpropJ. 

Proof 

The  proof  is  similar  to  the  one  given  for  Theorem  3.4  (iv)  and  (v)  and 
is,  therefore,  omitted. 


In  Theorem  3.5  a  result  was  given  that,  in  practice,  would  aid  an 
experimenter  to  assess  the  confidence  he  may  place  in  p  as  an  estimator 
of  p  for  a  certain  sample  size  n.  The  key  to  the  result  was  the  fact  that 


vAR(Prgn)>‘P(l-p)/n  l/4n. 


This  inequality  is,,  in  fact,  sharp;  i.e.,  it  cannot  be  improved  on.  In  this 
section  it  has  been  show  that 


mt*V>  i  VAR<P„r»a)  iTOR(P,=J. 


prop' 


rran' 
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with  equality  if  pj  *  p2 


•  ••  w  • 


Consequently,  l/4n  is  an  upper  bound  ofi  the  variances  of  pQpt  and  Ppr0p 
and  the  bound  is,  also,  sharp.  The  significance  of  the  following  theorem 
is,  therefore,  established. 


Theorem  4.7 


For  any  e>0  and  for  any  n>k 


Prob(|p-p|<e)  l-l/4ne2. 


P*Popt*  Pprop* 


Proof 


The  proof  is  omitted  because  of  its  similarity  to  the  proof  of  Theorem  3.5. 

Another  result  which  may  be  useful  when  the  sample  size  n  is  large  will 
be  given  next. 

Theorem  4.3 

tet  n  be  the  total  sample  size.  Then  the  asymptotice  distribution  of 


[VARfe)]*/* 

is  Normal (0,1),  as  n-*»-,  where  p  -  pQpt,  pprop. 

Proof 

This  result  follows  immediately  from  the  Central  Limit  Theorem  (Reference  3, 
2c. 5). 
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5.  SELECTION  OF  A  SAMPLING  PROCEDURE 

In  this  section  some  of  the  practical  aspects  of  choosing  among  the  three 
sampling  procedures  presented  in  this  report  shall  be  discussed.  This 
discussion  may,  also,  serve  as  a  brief  review  of  some  of  the  basic  properties 
of  these  procedures. 

It  is  difficult,  if  not  impossible,  to  consider  all  possible  situations 
an  experimenter  mcy  face  when  selecting  a  sampling  procedure.  It  may,  also, 
be  difficult  to  ascertain  which  procedure  is  the  "best"  to  use  in  practice 
for  a  particular  situation.  It  may  happen  in  practice  that  a  procedure  is 
,fbsst"  only  because  of  its  computational  simplicity  or  possibly  because  the 
theoretical  aspects  are  easy  to  explain  to  ones  employer.  The  following 
discussion,  however,  is  general  in  nature  and  is  intended  to  serve  only  as 
a  guide  when  selecting  a  procedure. 

Regardless  of  any  other  considerations,  the  experimenter  must  usually 
make  at  least  two  basic  decisions  before  he  selects  a  sampling  procedure. 
Firstly,  he  must  decide  whether  the  chief  purpose  of  the  experiment  is  to 
estimate  p  or  to  test  a  hypothesis  on  the  true  value  of  p.  Secondly,  he 
must  decide  whether  or  not  the  sample  size  n  will  be  larger  than  k,  the 
number  of  populations.  The  choice  of  a  sampling  procedure  for  these  four 
basic  situations  shall  he  discussed. 


Suppose  first  that  it  is  decided  that  k>n.  Immaterial  of  whether  the 
experimental  goal  is  estimation  or  hypothesis  testing,  the  only  appropriate 
procedure  to  use  is  random  allocation.  Hypothesis  tests  are  based  on  the 
statistic  i»Pran»  which  by  Theorem  3.3,  has  a  Binomial  distribution. 

Consider,  now,  the  case  when  k<n  and  the  purpose  of  the  experiment  is  to 
estimate  p.  For  most  problems  encountered  in  practice  it  happens  that 

VARCpopt)  <  V*R(pprop)  <  VARCp 

ran^* 

Since  the  estimator  with  the  smallest  variance  is  generally  considered  to  be 
best  it  follows  that  optimum  allocation  should  be  used  if  possible.  The 
weights  w^,ial,...,k,  given  by  (4.2)  may  not,  of  course,  be  known  in  practice. 
If  this  is  the  case  then  proportional  allocation  is  the  appropriate  sampling 
procedure. 
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Finally,  assume  that  k<n  and  the  experimenter  wishes  to  test  a  hypothesis 

on  the  true  value  of  p.  In  this  case  a  statistic  with  a  known  distribution 

that  depends  on  p  must  be  available.  Distributions,  in  a  closed  form,  of 

statistics  obtained  from  optimum  and  proportional  allocation  are  not  known 

to  the  author.  However,  by  Theorem  3.3,  the  distribution  of  np„  is  Binomial 

'ran 

with  parameters  (n,p).  It  appears  then  that  random  allocation  is  generally 
the  appropriate  procedure  to  use  under  these  circumstances.  It  is,  also, 
remarked  that  if  n  is  very  large  relative  to  k  then  the  distributions  of 
Popt  and  Ppr0p»  properly  normalized,  are  approximately  normal  with  mean  zero 
and  variance  one.  Unfortunately,  the  normalizing  factors  are  functions  of  the 
individual  pi,i=l,...,k,  and  not  explicitly  a  function  of  p.  It  is  further 
noted  that  if  n  is  very  large  relative  to  k,  then  proportional  allocation  and 
random  allocation  are  approximately  the  same  procedure.  That  is,  the  sample 
sizes  to  be  drawn  from  the  individual  populations  are  approximately  equal. 

6.  NUMERICAL  EXAMPLES 

In  this  section  several  numerical  examples  shall  be  given  which  will 
demonstrate  the  practical  significance  of  the  three  sampling  procedures 
presented  in  this  paper.  Before  these  examples  are  given,  it  should  be 
pointed  out  that  if  one  is  sampling  from  a  dichotomus  population  and  the 
probability  of  a  success  is  equal  to  a  fixed  constant  for  all  trials,  then 
the  population  is  infinite.  This  observation  will  be  useful  in  the  following 
examples . 

Example  1 

At  the  end  of  the  development  period  of  a  certain  surface-to-air 
missile  system,  the  Army  wishes  to  conduct  tests  that  will  demonstrate,  at  an 
appropriate  significance  level,  a  particular  performance  requirement  of  the 
system.  This  requirement  is  not  concerned  with  the  detection  of  threatening 
aircraft  but  only  with  the  performance  of  the  missile  from  launch  to 
detonation  of  the  warhead. 

This  missile  is  designed  to  destroy  air-breathing  aircraft  and  several 
factors  such  as  target  speed  and  range,  use  of  electronic  counter  measures 
and  others  are  assumed  to  affect  this  capability.  These  factors,  taken 
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together  in  various  combinations  yield  a  total  of  52  possible  states  that  a 
threat  may  assume.  In  the  event  of  an  enemy  attack  the  probability  that  a 
missile  will  encounter  an  aircraft  presenting  a  particular  threat  is  known. 
However,  it  is  not  known  what  the  probability  is  that  a  missile  will  destroy 
an  aircraft  given  that  the  aircraft  is  presenting  a  certain  threat. 

The  unconditional  probability  that  a  missile  will  destroy  an  aircraft 
is  called  the  expected  "kill  probability  (EKP)  and  it  is  required  that  testing 
demonstrate  that  this  probability  is  sufficiently  high  before  the  production 
phase  of  the  missile  system  is  begun. 

Suppose  that  the  52  possible  states  a  threatening  aircraft  can  assume 
are  numbered  and  let  be  the  known  probability  of  state  i,  i=l,...,52. 
Further,  let  p,  be  the  unknown  probability  that  a  missile  will  destroy  an 
aircraft  imposing  a  threat  in  the  i  state,  i=l,...,52.  From  this  notation 
it  follows  that  the  EKP  is 


Because  of  cost  only  40  missiles  can  be  allocated  for  the  testing. 
Also,  since  40,  the  sample  size,  is  less  than  52,  the  number,  of  possible 
threats,  and  hypothesis  testing  is  the  goal,  then  the  random  allocation 
procedure  is  appropriate.  In  general  terms  the  null  hypothesis,  H^,  and  the 
alternate  hypothesis,  HA,  are  defined  as: 

EKP  is  acceptable 

WpX  EKP  is  not  acceptable. 

The  decision  to  accept  or  reject  is  to  be  based  on  the  statistic 

40p  ,  which,  by  Theorem  3.3,  has  a  Binomial  distribution  with  parameters 

ran 

(40, p).  It  is  decided  that  if  40prgJJ  is  at  least  25  then  is  accepted 
while  if  40pran  is  less  than  25  then  is  rejected  in  favor  of  HA«  The 
corresponding  operating  characteristic  (OC)  curve,  which  is  based  on  Binomial 
probabilities,  is  given  in  Figure  6.1. 
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For  purposes  of  a  numerical  example  the  missile  tests  were  simulated  on  a 
computer  using  the  values  of  the  f^s  and  p1»s  given  in  Table  6.1  (Note  that 
in  practice  the  f^s  are  known  but  the  p^s  are  unknown).  The  simulation 
produced  first  the  number  of  missiles  to  be  fired  at  aircraft  presenting  the 
different  threats.  These  numbers,  denoted  by  ni#  i*l,...,52,  are  given  in 
Table  6.1.  Next  the  missile  firings  were  simulated  and  S^,  i»l,...,52,  given 
in  Table  6.1,  are  the  number  of  successful  firings.  This  simulation  yielded 
a  value  of  30  for  4fy>ran.  Therefore,  the  null  hypothesis,  H^j,  should  be 
accepted  implying  that  the  EKP  is  acceptable. 
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TABLE  6.1  INFORMATION  AND  SIMULATED  DATA  PERTAINING  TO  HYPOTHESIS 
TEST  ON  THE  EKP 


Threat 

i 

Frequency 

Destroy 

Probability 

Pi 

Sample 

Size 

n. 

l 

Successful 

Firings 

Si 

1 

.0576 

.72 

3 

3 

2 

.0144 

.60 

1 

1 

3 

.0384 

.60 

1 

0 

4 

.0384 

.72 

2 

1 

5 

.0096 

.60 

0 

0 

6 

.0256 

.60 

2 

2 

7 

.0096 

.55 

2 

2 

8 

.0064 

.55 

0 

0 

9 

.01728 

.72 

2 

1 

10 

.00432 

.60 

0 

0 

11 

.06912 

.60 

2 

6 

12 

.01152 

.72 

1 

i 

13 

.00288 

.60 

0 

0 

14 

.04608 

.60 

2 

2 

15 

.01728 

.55 

1 

0 

16 

.01152 

.55 

0 

0 

17 

.0192 

.72 

0 

0 

18 

.0048 

.60 

0 

0 

19 

.0128 

.72 

0 

0 

20 

.0032 

.60 

0 

0 

21 

.0480 

.72 

3 

3 

22 

.0120 

.60 

0 

0 

23 

.0120 

.60 

0 

0 

24 

.1440 

.72 

4 

4 

25 

.0360 

.60 

2 

2 

26 

.0360 

.60 

1 

1 

27 

.0030 

.55 

0 

0 

28 

.0090 

.55 

0 

0 

TABLE  6.1  (CONTINUED) 


Threat 

i 

Frequency 

fi 

Destroy 

Probability 

Pi 

Sample 

Size 

ni 

Successful 

Firings 

Si 

29 

.0396 

.72 

2 

2 

30 

.0099 

.60 

0 

0 

31 

.0044 

.60 

0 

0 

52 

.1188 

.55 

6 

2 

33 

.0297 

.55 

1 

1 

34 

.0132 

.72 

1 

1 

35 

.0011 

.60 

0 

0 

36 

.0033 

.72 

0 

0 

37 

.0020 

.72 

0 

0 

58 

.0005 

.55 

0 

0 

39 

.0060 

.72 

0 

0 

40 

.0015 

.55 

0 

0 

41 

.0016 

.72 

0 

0 

42 

.0004 

.69 

0 

0 

43 

.0144 

.72 

0 

0 

44 

.0036 

.60 

0 

0 

45 

.0016 

.72 

0 

0 

46 

.0004 

.60 

0 

0 

47 

.0144 

.72 

1 

1 

48 

.0036 

.60 

0 

0 

49 

.0008 

.72 

0 

0 

50 

.0002 

.55 

0 

0 

51 

.0072 

.72 

0 

0 

52 

.0018 

.55 

0 

0 

52 

EKP  «  Z  f.p,  «  0.643883 
i»l 

52 

n  «  Z  n.  »  40 
i«l  1 

52 

np  =  z  s. 

Pran  i-1  1 

r  30 

30 


Example  2 

Consider  again  the  missile  testing  problem  discussed  in  Example  1. 
Suppose  now  that  instead  of  hypothesis  testing  the  Army  wishes  to  estimate 
the  EKP  from  data  obtained  from  random  allocation.  How  large  should  the 
sample  size  n  be  in  order  for  the  Army  to  have  an  a  priori  probability  of  at 
least  0.95  that  the  estimator  pyjm  will  differ  from  the  true  EKP,  p,  by  less 
than  0.1?  That  is,  what  value  of  n  will  insure  that 

ProbClPran  *  Pi  <  >  °-9s? 

It  is  known  from  Theorem  3.3  that  np _  has  a  Binomial  distribution 

ran 

with  parameters  (n,p).  Hence, 

Prob(|prjm  -  p]  <  0.1)  «  Prob(|nprjm  -  np|  <_0.1n) 

.  Z  (n\vx  qn“x 

|x-np|  <  O.lnW 


where 


q  «  1-p  and  the  summation  is  extended  over  the  values  of  x  for  which 
fx  -  np|  <_  O.ln.  One  can  see  the  difficulty  in  attempting  to  determine 
directly  an  n  so  that  no  matter  what  p  is  the  inequality 


Z 

jx  -  np|  <_  O.ln 


(i) 


x  n-x 
P  <1 


>  0.95 


holds.  By  using  the  results  of  Theorem  3.5  one  may,  however,  easily  find 
such  an  n.  Now,  by  this  theorem 

PwMlPran  "  Pi  <  °a)  -  1-1/C0.1)2  4n. 


Therefore,  if 


1-1/ (0.1)2  4n  >0.95 

31 


: .  Jg&af 


then  it  follows  that 

n  >_  25/0.05  »  500. 

Consequently,  if  500  or  more  missiles  are  fired  then  the  a  priori  probability 
is  at  least  0.95  that  pran  will  differ  from  the  (unknown)  EKP,  p,  by  less 
than  0.1. 

Inasmuch  as  the  above  result  is  correct,  it  will  next  be  shown  that 
a  much  smaller  sample  size  n  will,  also,  suffice.  The  derivation  of  the 
smaller  sample  size  will  be  based  on  the  fact,  given  in  Theorem  3.6,  that 
p  is  approximately  normally  distributed  if  n  is  sufficiently  large.  From 
practice,  it  has  been  observed  that  a  sample  size  of  30  or  more  is  generally 
sufficiently  large  to  insure  that  this  approximation  is  adequate.  Hence,  if 
n  >  30  then 


Pr°MlPran  -  pi  <  0*1) 


c 


< 


) 


£ 


dt 


where 


=  (pq/n^ 


V2 


and  denotes  "approximately  equal  to.’ 


From  a  table  of  cumulative  standard  normal  probabilities  one  finds 


that  if 


O.l/o  >  1.96 
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Hence,  if 


TABLE  6.2  SIMULATED  DATA  PERTAINING  TO  THE  ESTIMATION  OF  THE  EKP 
FROM  RANDOM  ALLOCATION 


It  has  been  noted  before,  and  is  well  worth  mentioning  again,  that  the 
statistical  properties  of  pr&n  are  independent  of  k,  the  number  of  dichotomus 
populations.  This  means  that  a  sample  size  of  40  (96)  would  still  yield  the 
same  OC  curve  (precision)  in  Example  1  (2)  even  if  the  number  of  possible 
threats  was  not  52  but  any  arbitrarily  large  number.  Observe,  also,  that 
the  number  of  threat  states  actually  sampled  from  was  only  20  in  Example  1 
and  only  31  in  Example  2. 

Example  3 

Suppose  a  small,  inexpensive  item  is  produced  by  a  certain  factory 
for  the  Army  and  the  Army  wishes  to  estimate  the  fraction  of  defective  items 
produced  from  a  total  sample  size  of  1000.  Three  assembly  lines  contribute 
to  the  factory1?,  production  of  the  item  and  because  of  differences  in 
machinery  and  personnel  the  fractions  of  defective  items  produced  by  the 
assembly  lines  differ.  However,  after  production  the  items  are  stored 
together  and  are  not  distinguished  as  to  the  particular  assembly  line  which 
produced  them. 

Ordinarily,  a  random  sample  of  1000  items  would  be  taken  from  the 
storage  area  and  the  fraction  of  defective  items  produced  by  the  factory 
would  be  estimated  by  the  mean  number  of  defective  items  in  the  sample.  If 
the  proportions  f?,  f_,  say,  of  the  items  produced  by  Assembly  Lines  1, 

2  and  3,  respectively,  are  known  then  the  random  allocation  procedure  may,  also, 
be  used  to  estimate  the  fraction  of  defective  items  produced.  Any  additional 
effort  or  cost  it  may  take  to  implement  the  random  ax  location  procedure  would 
probably  not  be  justified  since,  in  Section  3,  it  was  noted  that  this  pro¬ 
cedure  and  the  random  sampling  procedure  are  statistically  equivalent.  Thus, 
it  is  immaterial  which  of  the  two  procedures  is  used  because  the  respective 
estimators  afford  the  same  precision. 

Assuming  that  the  f^'s  are  known  the  Army  may,  also,  use  the  pro¬ 
portional  allocation  procedure  since  1000,  the  sample  size,  is  greater  than  3, 
the  number  of  assembly  lines.  This  procedure,  it  has  been  pointed  out, 
generally  yields  a  more  precise  estimator  than  does  random  allocation.  Becau-e 
the  fj's  are  easily  obtainable  from  the  manufacturer  the  Army  decided  to  use 
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the  proportional  allocation  procedure  and  take  advantage  of  this  extra 
precision  in  estimating  the  fraction  of  defective  items  produced. 

This  estimation  problem  was  simulated  on  a  computer  when  Assembly 
Line  1  produces  items  5  percent  defective  and  accounts  for  25  percent  of  the 
production;  Assembly  Line  2  produces  items  4  percent  defective  and  accounts 
for  35  percent  of  the  production,  while  Assembly  Line  3  produces  items  3 
percent  defective  and  accounts  for  40  percent  of  the  production.  Of  course, 
the  fraction  of  defective  items  produced  by  each  assembly  line  would  be 
unknown  in  practice. 

The  simulation  results  are  given  in  Table  6.3,  where  n^  is  the  number 
of  items  to  be  drawn  from  Assembly  Line  i  and  is  the  number  of  defective 
items  in  the  sample,  i*l,  2,  3.  The  actual  fraction,  p,  say,  of  defective 
items  produced  by  the  factory  is 

p  «  0.0385. 

The  estimate  of  p,  resulting  from  the  simulation,  is 

P„„_  *  0.0330. 
prop 


TABLE  6.3  INFORMATION  AND  SIMULATED  DATA  PERTAINING  TO  THE  ESTIMATION  OF  THE 
FPACTION  DEFECTIVE  FROM  PROPORTIONAL  ALLOCATION 


Assembly 

Line 

i 

Proportion  of 

Production 

f. 

i 

Fraction  of 
Items  Defective 

Pi 

Sample 

Size 

ni 

Number  of 
Defectives 

S. 

l 

1 

0.250 

■  • 

250 

7 

2 

'  -  "1 

■ 

350 

16 

5 

400 

10 

3 

p  *  E  f.p. 
i*l 

=  0.0385 

3 

n  =  E  n. 
i*l  1 

3 

P„„  =  E  f.S./n. 
ProP  i=1  i  i  i 

=  1000 

*  0.0330 

Lxamplo  4 


Consider  again  the  estimation  problem  discussed  in  Example  3.  Suppose 
ti.c  manufacturer  informs  the  Army  chat  from  past  experience  with  the  manu¬ 
facturing  of  similar  type  items  the  production  pro  esses  of  the  three  assembly 
lines  usually  exhibit  a  certain  relationship.  I’articularly,  the  variation  of 
thu  production  process  for  Assembly  Line  2  is  generally  about  20  percent  less 
than  the  variation  of  the  production  process  for  Assembly  Line  1.  Also,  the 
variation  of  the  production  process  for  Assembly  Line  3  has  been  observed  to 
usually  he  about  40  percent  less  than  the  corresponding  variation  for  Assembly 
Line  1.  The  differences  in  the  age  and  design  of  the  machinery  in  the  three 
assembly  line.-  were  given  as  the  primary  reason  i'or  the  differences  in  the 
production  processes.  With  this  information  the  Army  may  use  the  optimum 
allocation  procedure  to  estimate  the  fraction  of  defective  items  produced 
by  the  factory  and,  thereby,  obtain  the  best  possible  precision. 

Again  let  f.  be  the  proportion  of  the  production  contributed  to 
Assembly  Line  j  and,  also,  let  p.^  be  the  fraction  of  the  items  produced  by 
Assembly  Line  i  which  are  defective,  i=l,  2,  5.  for  a  total  sample  size  n 
it  fellows  from  Theorem  4.1  that  the  sample  size  n^  to  be  taken  from  Assembly 
Line  i  is 


(6.1) 


where 


l-  fjiPjU-iyr 


from  remarks  made  in  Section  2  it  is  known  that  if 


^  ■■ 


if  an  item  drawn  at  random  from 
Assembly  Line  i  is  defective 

otherwise 


then  ECiy  *  p.  and  VAR(U.)  =  p.(l-p.),  1=1,  2,  3. 


Hence, 


[VAR(U.)] 


w.  * - - — 

1  3  l/0 

E  f.  [VAR(U.)]  /2 
j*l  ^ 


i  «  1,  2,  3. 


Also,  from  the  information  provided  by  the  manufacturer  it  is  known 
that  VAROn  is  approximately  equal  to  0.8  VAR(U  )  and  VAR(U  )  is  approximately 
equal  to  0.6  VARQ^).  Regarding  these  as  actual  equalities  one  has  that 


n  =£C0.»),/2n 
2  *  1 


(6.2) 


n  *  (0.6)  n  . 

3  *  1 


(6.3) 


Therefore,  for  a  total  sample  size  of  n  it  follows  that 


n  *  [1  ♦  j2*  (0.8) 1/2  ♦  j2.  (0.6) 1/2 ]  n  . 
1  1  1 
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Consequently, 


n 

l 


=  n/[i  +%.  (0.8)1/2 
*1 


(0.6) 1/2] . 


(6.4) 


If,  in  fact 


VAR(Uz)  -  0.8  VARCU^) 


(6.S) 


and 


VAR(U3)  «  0.6  VAR(Ui)  (6.6) 

then  the  sample  sizes  given  by  (6.2)  -  (6.4)  are  optimum  for  a  fixed  total 

sample  size  n.  If  the  variances  of  U  and  U  are  given  only  approximately 

2  3 

by  the  right-hand  sides  of  (6.5)  and  (6.6),  respectively,  then  the  sample 

sizes  n  ,  n  and  n  are,  of  course,  only  marly  optimum. 

12  3 

For  a  total  sample  size  of  n  »  1000  this  estimation  problem  was 

simulated  on  a  computer  using  the  same  f^’s  and  pi's  given  in  Example  3.  The 

exact  relationships  between  the  variances  of  U  ,  U  and  U  are  determined 

12  3 

from  p  p  and  p  and  are: 

12  *3 

VAR(ty  «  0.808  VAR(iy  (6.7) 

and 

VAR(U  )  »  0.613  VAR(U  ).  (6.8) 

3  1 

For  the  simulation  the  sample  sizes  u  ,  n  and  n  were  determined 

12  3 

assuming  that  the  relationships  (6.5)  and  (6.6)  hold.  The  resulting  sample 
sizes  are: 
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-  286; 

(6.9) 

«  359; 

(6.10) 

=>  355. 

(6.11) 

These  sample  sizes  are  only  nearly  optimum.  The  true  optimum  sample  sizes, 
based  on  liquation  (6.1),  are: 


n  *  285; 

n  =  358; 

n  =•  357. 
3 


Clearly,  then,  the  sample  sizes  given  by  (6.9)  -  (6.11)  may,  for  all  practical 
purposes,  be  considered  optimum. 

The  simulation  results  are  given  in  Table  6.4,  where  ^  is  the  number 
of  items  to  be  drawn  from  Assembly  Line  i  and  S.  is  the  number  of  defective 
items  in  the  sample,  i=l,  2,  3.  From  Table  6.3  the  true  fraction  of  defective 
items  produced  by  the  factory  is 


p  =*  0.0385. 

The  estimate  of  p,  obtained  from  this  simulation, is 

popt  =  °*0335, 
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A 


THE  FRACTION  DEFECTIVE  FROM  OPTIMUM  ALLOCATION 


Assembly 

sample 

Number  of 

Line 

Size 

Defectives 

i 

ni 

Si 

1 

286 

10 

2 

359 

15 

3 

355 

9 

3 

3 

n  •  Z  n. 
i*l  1 

V  ■  £  fi  V»i 

| 

«  1000 

*  0.0335 

!Xt  case  is  b 
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